Fix potential unicode conversion issues for *nix #7506

tex3d · 2025-06-03T05:42:04Z

There were multiple issues with Unicode conversion on *nix platforms. This PR fixes issues I found with the conversion functions that were causing failures when running locally, due to issues with setting the locale. It also had incorrect behavior for emulating the MultiByteToWideChar API.

This change makes the local setting thread safe and more robust to different available locales in runtime environments.

I fixed some off-by-one issues related to null termination, and eliminated some extra copies caused by detecting a string length, then passing the size without the null-terminator to a function which then had to copy the input string again to guarantee null-termination.

The CompilerTest::CompileWithEncodeFlagTestSource test has minor updates for clarity and an added scenario.

The changed code passes the Unicode tests now without asserting across all platforms.
This change should have no functional impacts, except eliminating potential double-null-termination in some cases, and catching more error conditions.

llvm-beanz

I think this is a more robust solution to the problem #7458 is trying to address.

@tex3d can you confirm?

tex3d · 2025-07-11T22:59:49Z

I think this is a more robust solution to the problem #7458 is trying to address.

@tex3d can you confirm?

Yeah, this one fixes a number of other (potential) issues too.

bogner · 2025-09-10T20:52:30Z

lib/DxcSupport/Unicode.cpp

+  if (cbMultiByte == 0 || cbMultiByte < -1 || cbMultiByte > (INT32_MAX - 1) ||
+      cchWideChar < 0 || cchWideChar > (INT32_MAX - 1)) {


Isn't cbMultiByte > (INT32_MAX - 1) equivalent to cbMultiByte == INT32_MAX? Unless we were to change cbMultiByte and cchWideChar to larger types I think the equality check is clearer, no?

igaryhe · 2025-10-11T09:52:59Z

Hello, I've tested this PR and it indeed fixes several test failure for me. However, I cannot get any output by running dxc -help or dxc --version. What could be wrong?

igaryhe · 2025-10-11T16:04:56Z

lib/DxcSupport/Unicode.cpp

 bool WideToEncodedString(const wchar_t *text, size_t cWide, DWORD cp,
                         DWORD flags, std::string *pValue, bool *lossy) {
+  DXASSERT_NOMSG(cWide == ~(size_t)0 || cWide < INT32_MAX);
+  if (text == nullptr || pValue == nullptr || cWide == 0 || cWide >= INT32_MAX)


Since cWide is set to ~(size_t)0 elsewhere, and this is guaranteed to be larger than INT32_MAX, so the last case seems a bit problematic.

Fix unicode conversion bugs for *nix

2198e7c

github-project-automation bot added this to HLSL Roadmap Jun 3, 2025

github-project-automation bot moved this to New in HLSL Roadmap Jun 3, 2025

llvm-beanz approved these changes Jul 11, 2025

View reviewed changes

tex3d added 2 commits August 29, 2025 10:40

Replace (size_t)-1 with ~(size_t)0 to avoid unsigned cast of -1

f1acf3d

Remove OOB index assert, catch more invalid uses/overflows

6173c2a

bogner approved these changes Sep 10, 2025

View reviewed changes

damyanp mentioned this pull request Sep 12, 2025

[SER] Incomprehensible error from initializing const payload in a certain way and using it in Invoke #7761

Closed

igaryhe reviewed Oct 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix potential unicode conversion issues for *nix #7506

Fix potential unicode conversion issues for *nix #7506

Uh oh!

tex3d commented Jun 3, 2025 •

edited

Loading

Uh oh!

llvm-beanz left a comment

Uh oh!

tex3d commented Jul 11, 2025

Uh oh!

bogner Sep 10, 2025

Uh oh!

igaryhe commented Oct 11, 2025

Uh oh!

igaryhe Oct 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		if (cbMultiByte == 0 \|\| cbMultiByte < -1 \|\| cbMultiByte > (INT32_MAX - 1) \|\|
		cchWideChar < 0 \|\| cchWideChar > (INT32_MAX - 1)) {

Fix potential unicode conversion issues for *nix #7506

Are you sure you want to change the base?

Fix potential unicode conversion issues for *nix #7506

Uh oh!

Conversation

tex3d commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvm-beanz left a comment

Choose a reason for hiding this comment

Uh oh!

tex3d commented Jul 11, 2025

Uh oh!

bogner Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

igaryhe commented Oct 11, 2025

Uh oh!

igaryhe Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tex3d commented Jun 3, 2025 •

edited

Loading